Product-free subsets of groups, then and now 

Kir an S. Kedlaya 



Dedicated to Joe Gallian on his 65th birthday and the 30th anniversary of the Duluth REU 



1. Introduction 

Let G be a group. A subset S of G is product-free if there do not exist a, 6, c G S 
(not necessarily distinclQ) such that ab = c. 

One can ask about the existence of large product-free subsets for various groups, 
such as the groups of integers (see next section) , or compact topological groups (as 
suggested in For the rest of this paper, however, I will require G to be a finite 

group of order n > 1. Let a(G) denote the size of the largest product-free subset 
of G; put /3(G) = a(G)/n, so that /3(G) is the density of the largest product-free 
subset. What can one say about a(G) or /3(G) as a function of G, or as a function 
of n? (Some of our answers will include an unspecified positive constant; I will 
always call this constant c.) 

The purpose of this paper is threefold. I first review the history of this problem, 
up to and including my involvement via Joe Gallian's REU (Research Experience 
for Undergraduates) at the University of Minnesota, Duluth, in 1994; since I did 
this once already in [llj . I will be briefer here. I then describe some very recent 
progress made by Gowers [7]. Finally, I speculate on the gap between the lower 
and upper bounds, and revisit my 1994 argument to show that this gap cannot be 
closed using Gowers's argument as given. 

Note the usual convention that multiplication and inversion are permitted to 
act on subsets of G, i.e., for A, B C G, 

AB = {ab : a € A.b G B}, A~ x = {a^ 1 : a e A} . 

2. Origins: the abelian case 

In the abelian case, product-free subsets are more customarily called sum-free 
subsets. The first group in which such subsets were studied is the group of integers 
Z; the first reference I could find for this is Abbott and Moser pQ, who expanded 
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upon Schur's theorem that the set {1, . . . , ^nlej } cannot be partitioned into n sum- 
free sets. This led naturally to considering sum-free subsets of finite abelian groups, 
for which the following is easy. 

Theorem 2.1. For G abelian, (3(G) > |. 

Proof. For G = Z/pZ with p > 2, we have a(G) > [^\ by taking 



S = 



{ 


p + 1 


,...,2 


p+1 






3 


3 





Then apply the following lemma. □ 

Lemma 2.2. For G arbitrary, if H is a quotient ofG, then 

(3(G) > (3(H). 

Proof. Let S' be a product-free subset of H of size a(H). The preimage of 
S' in G is product-free of size #5"#G/#i?, so a(G) > a(H)#G/#H. □ 

In fact, one can prove an exact formula for a(G) showing that this construction 
is essentially optimal. Many cases were established around 1970, but only in 2005 
was the proof of the following result finally completed by Green and Ruzsa [8] . 

Theorem 2.3 (Green- Ruzsa). Suppose that G is abelian. 

(a) // n is divisible by a prime p = 2 (mod 3), then for the least such p, 

= § + §• 

(b) Otherwise, if 3\n, then a(G) = ^. 

(c) Otherwise, a(G) = ^ — for m the exponent (largest order of any 
element) ofG. 

One possible explanation for the delay is that it took this long for this subject 
to migrate into the mathematical mainstream, as part of the modern subject of 
additive combinatorics |15) . see Section 01 

The first appearance of the problem of computing a(G) for nonabelian G seems 
to have been in a 1985 paper of Babai and Sos [2]. In fact, the problem appears 
there as an afterthought; the authors were more interested in Sidon sets, in which 
the equation ab^ 1 = cc? _1 has no solutions with a,b,c,d taking at least three 
distinct values. This construction can be related to embeddings of graphs as induced 
subgraphs of Cayley graphs; product-free subsets arise because they relate to the 
special case of embedding stars in Cayley graphs. Nonetheless, the Babai-S6s paper 
is the first to make a nontrivial assertion about a(G) for general G; see Theorem l3.ll 

This circumstance suggests rightly that the product-free problem is only one 
of a broad class of problems about structured subsets of groups; this class can be 
considered a nonabelian version of additive combinatorics, and progress on problems 
in this class has been driven as much by the development of the abelian theory as by 
interest from applications in theoretical computer science. An example of the latter 
is a problem of Cohn and Umans [5] (see also [6]): to find groups G admitting large 
subsets Si, S2, S3 such that the equation ai&f^fr^ 1 ^^ 1 = e : w i tn a ii hi 6 S,, has 
only solutions with ai = bi for all i. A sufficiently good construction would resolve 
an ancient problem in computational algebra: to prove that two nx n matrices can 
be multiplied using 0(n 2+e ) ring operations for any e > 0. 
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3. Lower bounds: Duluth, 1994 

Upon my arrival at the REU in 1994, Joe gave me the paper of Babai and Sos, 
perhaps hoping I would have some new insight about Sidon sets. Instead, I took 
the path less traveled and started thinking about product-free sets. 

The construction of product-free subsets given in [2 is quite simple: if H is a 
proper subgroup of G, then any nontrivial coset of H is product-free. This is trivial 
to prove directly, but it occurred to me to formulate it in terms of permutation 
actions. Recall that specifying a transitive permutation action of the group G is 
the same as simply identifying a conjugacy class of subgroups: if H is one of the 
subgroups, the action is left multiplication on left cosets of H. (Conversely, given 
an action, the point stabilizers are conjugate subgroups.) The construction of Babai 
and Sos can then be described as follows. 

Theorem 3.1 (Babai-S6s). For G admitting a transitive action on {1, . . . , to} 
with to > 1, /3(G) > m . 

PROOF. The set of all g G G such that g(l) = 2 is product-free of size n/m. □ 

I next wondered: what if you allow g to carry 1 into a slightly larger set, say a 
set T of k elements? You would still get a product-free set if you forced each x G T 
to map to something not in T. This led to the following argument. 

Theorem 3.2. For G admitting a transitive action on {1, ... , to} with m > 1, 
/3(G) > cm- 1 / 2 . 

Proof. For a given k, we compute a lower bound for the average size of 
S= \J{g&G:g(l) = x}- \J {g G G : g(l), g(y) G T} 

for T running over fc-element subsets of {2, . . . , to}. Each set in the first union 
contains n/m elements, and they are all disjoint, so the first union contains kn/m 
elements. To compute the average of a set in the second union, note that for fixed 
g G G and y G {2, . . . , to}, a fc-element subset T of {1, . . . , m} contains g(l), y, g(y) 
with probability k i k ~ 1 )^ if two of the three coincide and fc / fc ~ 1 1 l'> fc ~ 2 L otherwise. 

1 J m(m — 1) m(m— l)(m— 2) 

A bit of arithmetic then shows that the average size of S is at least 

kn k 3 n 
m (to — 2) 2 

Taking k ~ (m/3) 1 / 2 , we obtain a(G) > cnjm 1 ! 2 . (For any fixed e > 0, the 
implied constant can be improved to e _1 — e for to sufficiently large; see the proof 
of Theorem 16.21 On the other hand, the proof as given can be made constructive 
in case G is doubly transitive, as then there is no need to average over T.) □ 

This gives a lower bound depending on the parameter m, which we can view as 
the index of the largest proper subgroup of G. To state a bound depending only on 
n, one needs to know something about the dependence of m on n; by Lemma F2.2[ 
it suffices to prove a lower bound on to in terms of n for all simple nonabelian 
groups. I knew this could be done in principle using the classification of finite 
simple groups (CFSG); after some asking around, I got hold of a manuscript by 
Liebeck and Shalev [12j that included the bound I wanted, leading to the following 
result from [10] . 
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Theorem 3.3. Under CFSG, the group G admits a transitive action on a set 
of size 1 < m < cn 3 / 7 . Consequently, Theorem \3.1\ implies a(G) > en 4 / 7 , whereas 
Theorem \3.2\ imvlies a(G) > cn 11 / 14 . 

At this point, I was pretty excited to have discovered something interesting 
and probably publishablc. On the other hand, I was completely out of ideas! I had 
no hope of getting any stronger results, even for specific classes of groups, and it 
seemed impossible to derive any nontrivial upper bounds at all. In fact, Babai and 
Sos suggested in their paper that maybe /3(G) > c for all G; I was dubious about 
this, but I couldn't convince myself that one couldn't have (3(G) > cn~ e for all 
e > 0. 

So I decided to write this result up by itself, as my first Duluth paper, and 
ask Joe for another problem (which naturally he provided). My paper ended up 
appearing as [10] : I revisited the topic when I was asked to submit a paper in 
connection with being named a runner-up for the Morgan Prize for undergraduate 
research, the result being [11] , 

I then put this problem in a mental deep freezer, figuring (hoping?) that my 
youthful foray into combinatorics would be ultimately forgotten, once I had made 
some headway with some more serious mathematics, like algebraic number theory 
or algebraic geometry. I was reassured by the expectation that the nonabelian 
product-free problem was both intractable and of no interest to anyone, certainly 
not to any serious mathematician. 

Ten years passed0 



4. Interlude: back to the future 

Up until several weeks before the Duluth conference, I had been planning to 
speak about the latest and greatest in algebraic number theory (the proof of Serre's 
conjecture linking modular forms and mod p Galois representations, recently com- 
pleted by Khare and Wintenberger) . Then I got an email that suggested that 
maybe I should try embracing my past instead of running from it. 

A number theorist friend (Michael Schein) reported having attended an al- 
gebra seminar at Hebrew University about product-free subsets of finite groups, 
and hearing my name in this context. My immediate reaction was to wonder what 
sclf-rcspccting mathematician could possibly be interested in my work on this prob- 
lem. The answer was Tim Gowers, who had recently established a nontrivial upper 
bound for a(G) using a remarkably simple argument. 

It seems that in the ten years since I had moved on to ostensibly more main- 
stream mathematics, additive combinatorics had come into its own, thanks partly 
to the efforts of no fewer than three Fields medalists (Tim Gowers, Jean Bourgain, 
and Terry Tao); some sources date the start of this boom to Ruzsa's publication in 
1994 of a simplified proof [14] of a theorem of Freiman on subsets of Z/pZ having 
few pairwise sums. In the process, some interest had spilled over to nonabelian 
problems. 



If you do not recognize this reference, you may not have read the excellent novel The 
Grasshopper King, by fellow Duluth REU alumnus Jordan Ellcnberg. 
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The introduction to Gowers's paper [7] cites! my Duluth paper as giving the 
best known lower bound on a(G) for general G. At this point, it became clear that 
I had to abandon my previous plan for the conference in favor of a return visit to 
my mathematical roots. 

5. Upper bounds: bipartite Cayley graphs 

In this section, I'll proceed quickly through Gowers's upper bound construction. 
Gowers's paper [7] is exquisitely detailed; I'll take that fact as license to be slightly 
less meticulous here. 

The strategy of Gowers is to consider three sets A, B, C for which there is no 
true equation ab — c with a £ A,b G B , c £ C , and give an upper bound on 
#A#B#C. To do this, he studies a certain bipartite Cayley graph associated to G. 
Consider the bipartite graph T with vertex set V\ U V<z , where each Vi is a copy of 
G, with an edge from x £ V\ to y £ Vi if and only if yx^ 1 £ A. We are then given 
that there are no edges between B C V\ and CC V 2 . 

A good reflex at this point would be to consider the eigenvalues of the adjacency 
matrix of T. For bipartite graphs, it is more convenient to do something slightly 
different using singular values; although this variant of spectral analysis of graphs 
is quite natural, I am only aware of the reference [3] from 2004 (and only thanks to 
Gowers for pointing it out). Let N be the incidence matrix, with columns indexed 
by V\ and rows by Vi, with an entry in row x and column y if xy is an edge of Y. 

Theorem 5.1. We can factor N as a product UT,V o/|Gx #G matrices over 
R, with U, V orthogonal and £ diagonal with nonnegative entries. (This is called a 
singular value decomposition of N.) 

PROOF. (Compare Theorem 2.6], or see any textbook on numerical linear 
algebra.) By compactness of the unit ball, there is a greatest A such that \\Nv\\ = 
A||v|| for some nonzero v £ R Vl . If v- w = 0, then f(t) = \\N(v + txv)\\ 2 has a local 
maximum at t — 0, so 

= — IIMv + tw)|| 2 = 2t(Nv) ■ (Nw). 
dt 

Apply the same construction to the orthogonal complement of Rv in M. Vl . Repeat- 
ing, we obtain an orthonormal basis of K^ 1 ; the previous calculation shows that the 
image of this basis in MX 2 is also orthogonal. Using these to construct V, U yields 
the claim. □ 

The matrix M — NN T is symmetric, and has several convenient properties. 

(a) The trace of M equals the number of edges of T. 

(b) The eigenvalues of M are the squares of the diagonal entries of S. 

(c) Since T is regular of degree #A and connected, the largest eigenvalue of 
M is #A, achieved by the all-ones eigenvector 1. 

Lemma 5.2. Let A be the second largest diagonal entry o/S. Then the set W 
of v G K^ 1 with v • 1 = and ||ATv|| = A||v|| is a nonzero subspace ofM. Vl . 



J Since Joe is fond of noting "program firsts" , I should point out that this appears to be the 
first citation of a Duluth paper by a Fields medalist. To my chagrin, I think it is also the first 
such citation of any of my papers. 
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Proof. (Compare [3 Lemma 2.7].) From Theorem 15. 1[ we obtain an orthog- 
onal basis vi, . . . ,v„ of R^ 1 , with Vi = 1, such that ATvi, . . . ,Nv n are orthogonal, 
and ||A/Vi||/||vi||, . . . , ||iVv n ||/||v n || are the diagonal entries of E; we may then 
identify W as the span of the Vj with i > 1 and ||ATvj|| = A||v,||. 

Alternatively, one may note that W is obviously closed under scalar multipli- 
cation, then check that W is closed under addition as follows. If vi,V2 G W, then 
|| JV(vi =h V2 ) | < A||vi ± v"2 1| , but by the parallelogram law 

\\Nv! + Nv 2 \\ 2 + \\Nv! - Nv 2 \\ 2 = 2||7V Vl || 2 + 2||7Vv 2 || 2 

= 2A 2 ||v 1 || 2 + 2A 2 ||v 2 || 2 

-A 2 ||v 1 +v 2 || 2 + A 2 ||v 1 -v 2 || 2 . 

Hence ||JV(v a ±v 2 )|| = A||v x ±v 2 ||. □ 

Gowers's upper bound on a(G) involves the parameter S, defined as the smallest 
dimension of a nontrivial representation^ of G. For instance, if G — PSL 2 (g) with 
qodd, then then 6 = (q — l)/2. 

Lemma 5.3. J/v G M. Vl satisfies v 1 = 0, then \\Nv\\ < (n^A/Sf^W v|| . 

PROOF. Take A, W as in Lemma f5T2l Let G act on V\ and V 2 by right multi- 
plication; then G also acts on T. In this manner, W becomes a real representation 
of G in which no nonzero vector is fixed. In particular, dim(M / ) > 5. 

Now note that the number of edges of M, which is n#A, equals the trace of M, 
which is at least dim(W^)A 2 > <5A 2 . This gives A 2 < n#A/5, proving the claim. □ 

We are now ready to prove Gowers's theorem [7j Theorem 3.3]. 

Theorem 5.4 (Gowers). If A,B,C are subsets of G such that there is no true 
equation ab = c with a G A,b G B,c G C, then ^Ajj^Bjj^C < n 3 /S. Consequently, 
(3(G) < S- 1 / 3 . 

For example, if G = PSL 2 (q) with q odd, then n ~ cq 3 , so a(G) < cn 8 ^ 9 . On 
the lower bound side, G admits subgroups of index m ~ cq, so a(G) > cn 5 ^ 6 . 

Proof. Write #A = m, #_B = sn, #C = tn. Let v be the characteristic 
function of B viewed as an element of M. Vl , and put w = v — si. Then 

w • 1 = 

w • w = (1 - s) 2 #B + s 2 (n - #B) = s(l - s)n < sn, 

so by Lemma f5 .31 1 1 A^w| | 2 < rn 2 sn/S. 

Since ab — c has no solutions with a G A, b G B,c G C, each element of C 
corresponds to a zero entry in TV v. However, JVv = TVw + rsnl, so each zero entry 
in Nv corresponds to an entry of Nw equal to —rsn. Therefore, 

(tn)(rsn) 2 < \\Nw\\ 2 < rsn 3 /6, 

hence rstS < 1 as desired. □ 

As noted by Nikolov and Pyber [13] . the extra strength in Gowers's theorem 
is useful for other applications in group theory, largely via the following corollary. 



One could just as well restrict to real representations, which would increase 6 by a factor of 
2 in some cases. For instance, if G = PSL2(q) with q = 3 (mod 4), this would give 5 = q — 1. 
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Corollary 5.5 (Nikolov-Pyber) . If A, B, C are subsets ofG such that ABC ^ 
G, then #A#B#C < n 3 /S. 

Proof. Suppose that #A#B#C > n 3 /5. Put D = G\ AB, so that #D = 
n - #(AB). By Theorem E3 we have #A#B#D < n 3 /5, so #C > #D. Then 
for any g G C, the sets AB and gC~ x have total cardinality more than n, so they 
must intersect. This yields ABC = G. □ 

Gowers indicates that his motivation for this argument was the notion of a 
quasi-random graph introduced by Chung, Graham, and Wilson [4]. They show 
that (in a suitable quantitative sense) a graph looks random in the sense of having 
the right number of short cycles if and only if it also looks random from the spectral 
viewpoint, i.e., the second largest eigenvalue of its adjacency matrix is not too large. 



6. Coda 



As noted by Nikolov and Pyber [13j . using CFSG to get a strong quantitative 
version of Jordan's theorem on finite linear groups, one can produce upper and 
lower bounds for a(G) that look similar. (Keep in mind that the index of a proper 
subgroup must be at least (5+1, since any permutation representation of degree m 
contains a linear representation of dimension m — 1.) 

Theorem 6.1. Under CFSG, the group G has a proper subgroup of index at 
most cS 2 . Consequently, 

cn/S < a(G) < cn/6 1/3 . 

Moreover, for many natural examples (e.g., G = A m or G = PSL^g)), G has 
a proper subgroup of index at most cS, in which case one has 

cn/5 1/2 < a(G) < cn/S 1/3 . 

Since the gap now appears quite small, one might ask about closing it. However, 
one can adapt the argument of [lOj to show that Gowers's argument alone will not 
suffice, at least for families of groups with m < cS. (Gowers proves some additional 
results about products taken more than two at a time §5]; I have not attempted 
to extend this construction to that setting.) 

Theorem 6.2. Given e > 0, for G admitting a transitive action on {1, . . . , to} 
for m sufficiently large, there exist A,B,C C G with (ffA){f^B)(ffC) > (e _1 — 
e)n 3 /m, such that the equation ab = c has no solutions with a G A,b G B,c G C. 
Moreover, we can force B = C , C = A, or A = B^ 1 if desired. 

Proof. We first give a quick proof of the lower bound cn 3 /m. Let U, V be 
subsets of {1, ... , m} of respective sizes u, v. Put 



A = {g e G 
B = {geG 
C = {geG 



g(U) n V = 0} 

5(1) e u} 

.9(1) G V}; 



then clearly the equation ab = c has no solutions with a G A, b G B, c G C. On the 
other hand, 

, vn „ „ un ,, vn 

#A>n~u—, #B = — , #C=— , 
m m m 
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and so 

m)mm c)>-(-) (i--). 

in V Tfi J V m J 
By taking u,v = [y/m/2\, we obtain (#A)(#B)(#C) > cn 3 /m. 

To optimize the constant, we must average over choices of U, V. Take u,v — 
\_\frn\ ■ By inclusion-exclusion, for any positive integer h, the average of is 
bounded below by 

2/i-l 

(-i)V 

(The i-th term counts occurrences of z-element subsets in g(U) fl V. We find (") i- 
element sets inside V; on average, each one occurs inside g{U) for / ( 7 ™) choices 
of g.) Rewrite this as 

»(|Vi)<™^ + »(A 



\p ( - 1) ■ ■ ■ (m - ^ + j>(> - 1) • ■ • (v - i + 1) 

^— ' z!m(m —!)■■• (m — i + 1) 



where o(l) — > as m — > oo. For any e > 0, we have 

771^ 

for ft, sufficiently large, and m sufficiently large depending on h. This gives the 
desired lower bound. 

Finally, note that we may achieve B = C by taking U — V. To achieve the 
other equalities, note that if the triplet A, B, C has the desired property, so do 
B-\ A- 1 ,^ 1 and C\B-\A. □ 

I have no idea whether one can sharpen Theorem 15.41 under the hypothesis 
A = B = C (or even just A = B). It might be enlightening to collect some 
numerical evidence using examples generated by Theorem 13. 2\ with Xuancheng 
Shao, we have done this for PSL2(g) for q < 19. 

I should also mention again that (as suggested in [11] ) one can also study 
product-free subsets of compact topological groups, which are large for Haar mea- 
sure. Some such study is implicit in §4], but we do not know what explicit 
bounds come out. 
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